Robust estimation of the false discovery rate

نویسندگان

  • Stan Pounds
  • Cheng Cheng
چکیده

MOTIVATION Presently available methods that use p-values to estimate or control the false discovery rate (FDR) implicitly assume that p-values are continuously distributed and based on two-sided tests. Therefore, it is difficult to reliably estimate the FDR when p-values are discrete or based on one-sided tests. RESULTS A simple and robust method to estimate the FDR is proposed. The proposed method does not rely on implicit assumptions that tests are two-sided or yield continuously distributed p-values. The proposed method is proven to be conservative and have desirable large-sample properties. In addition, the proposed method was among the best performers across a series of 'real data simulations' comparing the performance of five currently available methods. AVAILABILITY Libraries of S-plus and R routines to implement the method are freely available from www.stjuderesearch.org/depts/biostats.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The False Discovery Rate in Simultaneous Fisher and Adjusted Permutation Hypothesis Testing on Microarray Data

Background and Objectives: In recent years, new technologies have led to produce a large amount of data and in the field of biology, microarray technology has also dramatically developed. Meanwhile, the Fisher test is used to compare the control group with two or more experimental groups and also to detect the differentially expressed genes. In this study, the false discovery rate was investiga...

متن کامل

A windowed local fdr estimator providing higher resolution and robust thresholds

Motivation: In microarray analysis, special consideration must be given to the issues of multiple statistical tests and typically p-values are adjusted to control family-wise error rate (FWER) or false discovery rate (FDR). FDR metrics have been suggested for controlling false positives, however, genes with p-values close to the threshold typically have a higher chance of being false positives ...

متن کامل

Université Paris Diderot — Paris 7

This thesis deals with statistical questions raised by the analysis of highdimensional genomic data for cancer research. In the first part, we study asymptotic properties of multiple testing procedures that aim at controlling the False Discovery Rate (FDR), that is, the expected False Discovery Proportion (FDP) among rejected hypotheses. We develop a versatile formalism to calculate the asympto...

متن کامل

Estimation of False Discovery Rate Using Permutation P -Values with Different Discrete Null Distributions

The false discovery rate (FDR) is a multiple testing error rate which describes the expected proportion of expected type I errors among the total number of rejected hypotheses. Benjamini and Hochberg introduced this quantity and provided an estimator that is conservative when the number of true null hypotheses, m0, is smaller than the number of tests, m. Replacing m with m0 in Benjamini and Hoc...

متن کامل

A mixture model for estimating the local false discovery rate in DNA microarray analysis

MOTIVATION Statistical methods based on controlling the false discovery rate (FDR) or positive false discovery rate (pFDR) are now well established in identifying differentially expressed genes in DNA microarray. Several authors have recently raised the important issue that FDR or pFDR may give misleading inference when specific genes are of interest because they average the genes under conside...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Bioinformatics

دوره 22 16  شماره 

صفحات  -

تاریخ انتشار 2006